Overview

Dataset Statistics

Number of Variables 45
Number of Rows 1.3716e+06
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 580.8 MB
Average Row Size in Memory 444.0 B
Variable Types
  • Numerical: 31
  • Categorical: 14

Dataset Insights

brand_clicks and brand_click_ratio have similar distributions Similar Distribution
price and total_amount have similar distributions Similar Distribution
price and total_amount_sum have similar distributions Similar Distribution
price and time_diff have similar distributions Similar Distribution
quantity and session_buy_count have similar distributions Similar Distribution
quantity and conversion_rate have similar distributions Similar Distribution
buy_hour and buy_minute have similar distributions Similar Distribution
buy_hour and buy_day have similar distributions Similar Distribution
buy_hour and session_buy_count have similar distributions Similar Distribution
buy_minute and buy_day have similar distributions Similar Distribution
total_amount and total_amount_sum have similar distributions Similar Distribution
total_amount and time_diff have similar distributions Similar Distribution
item_id is skewed Skewed
click_day is skewed Skewed
click_order is skewed Skewed
session_click_count is skewed Skewed
avg_click_interval is skewed Skewed
item_click_count is skewed Skewed
category_click_count is skewed Skewed
category_click_ratio is skewed Skewed
unique_category_count is skewed Skewed
special_offer_clicks is skewed Skewed
missing_category_clicks is skewed Skewed
brand_clicks is skewed Skewed
main_category_clicks is skewed Skewed
special_offer_click_ratio is skewed Skewed
missing_category_click_ratio is skewed Skewed
brand_click_ratio is skewed Skewed
main_category_click_ratio is skewed Skewed
price is skewed Skewed
quantity is skewed Skewed
buy_hour is skewed Skewed
buy_minute is skewed Skewed
buy_day is skewed Skewed
total_amount is skewed Skewed
session_buy_count is skewed Skewed
total_amount_sum is skewed Skewed
time_diff is skewed Skewed
avg_time_diff is skewed Skewed
conversion_rate is skewed Skewed
category has a high cardinality: 83 distinct values High Cardinality
category_type has a high cardinality: 83 distinct values High Cardinality
click_month has constant value "8" Constant
click_dayofweek has constant length 1 Constant Length
click_is_weekend has constant length 1 Constant Length
click_month has constant length 1 Constant Length
is_special_offer has constant length 1 Constant Length
is_missing_category has constant length 1 Constant Length
is_brand has constant length 1 Constant Length
is_main_category has constant length 1 Constant Length
buy_dayofweek has constant length 3 Constant Length
buy_is_weekend has constant length 3 Constant Length
buy_month has constant length 3 Constant Length
conversion has constant length 1 Constant Length
conversion_session has constant length 1 Constant Length
avg_click_interval has 103999 (7.58%) zeros Zeros
special_offer_clicks has 223673 (16.31%) zeros Zeros
missing_category_clicks has 1190130 (86.77%) zeros Zeros
brand_clicks has 1367264 (99.68%) zeros Zeros
main_category_clicks has 813145 (59.28%) zeros Zeros
special_offer_click_ratio has 223673 (16.31%) zeros Zeros
missing_category_click_ratio has 1190130 (86.77%) zeros Zeros
brand_click_ratio has 1367264 (99.68%) zeros Zeros
main_category_click_ratio has 813145 (59.28%) zeros Zeros
price has 1327538 (96.78%) zeros Zeros
quantity has 1327538 (96.78%) zeros Zeros
buy_hour has 1320833 (96.3%) zeros Zeros
buy_minute has 1321566 (96.35%) zeros Zeros
buy_day has 1320759 (96.29%) zeros Zeros
total_amount has 1327538 (96.78%) zeros Zeros
session_buy_count has 1320759 (96.29%) zeros Zeros
total_amount_sum has 1327538 (96.78%) zeros Zeros
time_diff has 1320759 (96.29%) zeros Zeros
avg_time_diff has 1251762 (91.26%) zeros Zeros
conversion_rate has 1320759 (96.29%) zeros Zeros
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8

Variables


session_id

numerical

Approximate Distinct Count 477673
Approximate Unique (%) 34.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16459668
Mean 8.2304e+06
Minimum 7930294
Maximum 8527384
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • session_id is skewed left (γ1 = -0.0032)

Quantile Statistics

Minimum 7930294
5-th Percentile 7.9593e+06
Q1 8.081e+06
Median 8.2297e+06
Q3 8.3781e+06
95-th Percentile 8.497e+06
Maximum 8527384
Range 597090
IQR 297120

Descriptive Statistics

Mean 8.2304e+06
Standard Deviation 172049.807
Variance 2.9601e+10
Sum 1.1289e+13
Skewness -0.003237
Kurtosis -1.1917
Coefficient of Variation 0.0209

item_id

numerical

Approximate Distinct Count 23466
Approximate Unique (%) 1.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16459668
Mean 2.1477e+08
Minimum 214507331
Maximum 214988439
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • item_id is skewed left (γ1 = -0.9451)

Quantile Statistics

Minimum 214507331
5-th Percentile 2.1455e+08
Q1 2.147e+08
Median 2.1484e+08
Q3 2.1485e+08
95-th Percentile 2.1485e+08
Maximum 214988439
Range 481108
IQR 154281

Descriptive Statistics

Mean 2.1477e+08
Standard Deviation 110008.1159
Variance 1.2102e+10
Sum 2.9458e+14
Skewness -0.9451
Kurtosis -0.5946
Coefficient of Variation 0.00051222
  • item_id is not normally distributed (p-value 8.219926632128322e-24)

category

categorical

Approximate Distinct Count 83
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90543853
  • The largest value (S) is over 8.28 times larger than the second largest value (1)

Length

Mean 1.0114
Standard Deviation 0.1807
Median 1
Minimum 1
Maximum 10

Sample

1st row 3
2nd row 0
3rd row 0
4th row 0
5th row S

Letter

Count 951651
Lowercase Letter 0
Space Separator 0
Uppercase Letter 951651
Dash Punctuation 0
Decimal Number 435667
  • The top 2 categories (S, 1) take over 50.0%
  • The largest value (s) is over 8.28 times larger than the second largest value (1)

click_hour

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 13.2903
Minimum 0
Maximum 23
Zeros 6581
Zeros (%) 0.5%
Negatives 0
Negatives (%) 0.0%
  • click_hour is skewed left (γ1 = -0.1417)

Quantile Statistics

Minimum 0
5-th Percentile 5
Q1 9
Median 13
Q3 18
95-th Percentile 21
Maximum 23
Range 23
IQR 9

Descriptive Statistics

Mean 13.2903
Standard Deviation 5.0698
Variance 25.7027
Sum 1.8229e+07
Skewness -0.1417
Kurtosis -0.9216
Coefficient of Variation 0.3815
  • click_hour is not normally distributed (p-value 0.0006817726695701307)

click_minute

numerical

Approximate Distinct Count 60
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 29.5671
Minimum 0
Maximum 59
Zeros 22362
Zeros (%) 1.6%
Negatives 0
Negatives (%) 0.0%
  • click_minute is skewed left (γ1 = -0.0051)

Quantile Statistics

Minimum 0
5-th Percentile 3
Q1 15
Median 30
Q3 45
95-th Percentile 57
Maximum 59
Range 59
IQR 30

Descriptive Statistics

Mean 29.5671
Standard Deviation 17.3163
Variance 299.8546
Sum 4.0555e+07
Skewness -0.005124
Kurtosis -1.2026
Coefficient of Variation 0.5857
  • click_minute is not normally distributed (p-value 0.0004534334435910105)

click_day

numerical

Approximate Distinct Count 15
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 14.8711
Minimum 5
Maximum 19
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • click_day is skewed left (γ1 = -1.0804)

Quantile Statistics

Minimum 5
5-th Percentile 9
Q1 13
Median 15
Q3 17
95-th Percentile 18
Maximum 19
Range 14
IQR 4

Descriptive Statistics

Mean 14.8711
Standard Deviation 2.8978
Variance 8.3972
Sum 2.0398e+07
Skewness -1.0804
Kurtosis 0.9034
Coefficient of Variation 0.1949
  • click_day is not normally distributed (p-value 2.1585111198805647e-10)
  • click_day has 24462 outliers

click_dayofweek

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 5
2nd row 5
3rd row 5
4th row 5
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • click_dayofweek has words of constant length

click_is_weekend

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174
  • The largest value (0) is over 1.94 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 1.94 times larger than the second largest value (1)
  • click_is_weekend has words of constant length

click_month

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 8
2nd row 8
3rd row 8
4th row 8
5th row 8

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • click_month has words of constant length

category_type

categorical

Approximate Distinct Count 83
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 105582283
  • The largest value (special_offer) is over 8.28 times larger than the second largest value (category_1)

Length

Mean 11.9753
Standard Deviation 1.6468
Median 13
Minimum 7
Maximum 16

Sample

1st row category_3
2nd row missing
3rd row missing
4th row missing
5th row special_offer

Letter

Count 14725136
Lowercase Letter 14725136
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 382320
  • The top 2 categories (special_offer, category_1) take over 50.0%
  • The largest value (special_offer) is over 8.28 times larger than the second largest value (category_1)

click_order

numerical

Approximate Distinct Count 198
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 4.106
Minimum 1
Maximum 198
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • click_order is skewed right (γ1 = 7.9876)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 2
Q3 4
95-th Percentile 14
Maximum 198
Range 197
IQR 3

Descriptive Statistics

Mean 4.106
Standard Deviation 6.6796
Variance 44.6169
Sum 5.6319e+06
Skewness 7.9876
Kurtosis 113.2609
Coefficient of Variation 1.6268
  • click_order is not normally distributed (p-value 2.700504633313486e-24)
  • click_order has 140242 outliers

session_click_count

numerical

Approximate Distinct Count 120
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 7.5791
Minimum 1
Maximum 200
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • session_click_count is skewed right (γ1 = 6.4292)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 4
Q3 8
95-th Percentile 26
Maximum 200
Range 199
IQR 6

Descriptive Statistics

Mean 7.5791
Standard Deviation 11.2298
Variance 126.1076
Sum 1.0396e+07
Skewness 6.4292
Kurtosis 67.649
Coefficient of Variation 1.4817
  • session_click_count is not normally distributed (p-value 4.8209354904943845e-21)
  • session_click_count has 116364 outliers

avg_click_interval

numerical

Approximate Distinct Count 297072
Approximate Unique (%) 21.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 140.0084
Minimum 0
Maximum 3599.296
Zeros 103999
Zeros (%) 7.6%
Negatives 0
Negatives (%) 0.0%
  • avg_click_interval is skewed right (γ1 = 6.2726)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 41.8203
Median 79.4646
Q3 150.901
95-th Percentile 476.7503
Maximum 3599.296
Range 3599.296
IQR 109.0807

Descriptive Statistics

Mean 140.0084
Standard Deviation 235.1115
Variance 55277.4404
Sum 1.9204e+08
Skewness 6.2726
Kurtosis 57.2212
Coefficient of Variation 1.6793
  • avg_click_interval is not normally distributed (p-value 3.2212321637763447e-19)
  • avg_click_interval has 120998 outliers

item_click_count

numerical

Approximate Distinct Count 26
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 1.2137
Minimum 1
Maximum 48
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • item_click_count is skewed right (γ1 = 5.7185)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 1
Q3 1
95-th Percentile 2
Maximum 48
Range 47
IQR 0

Descriptive Statistics

Mean 1.2137
Standard Deviation 0.6156
Variance 0.379
Sum 1.6648e+06
Skewness 5.7185
Kurtosis 86.4141
Coefficient of Variation 0.5072
  • item_click_count is not normally distributed (p-value 1.2408518743977642e-24)
  • item_click_count has 213757 outliers

category_click_count

numerical

Approximate Distinct Count 97
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 5.7233
Minimum 1
Maximum 147
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • category_click_count is skewed right (γ1 = 5.9891)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 3
Q3 7
95-th Percentile 17
Maximum 147
Range 146
IQR 5

Descriptive Statistics

Mean 5.7233
Standard Deviation 7.7297
Variance 59.7489
Sum 7.8503e+06
Skewness 5.9891
Kurtosis 59.4921
Coefficient of Variation 1.3506
  • category_click_count is not normally distributed (p-value 1.3477767810626691e-20)
  • category_click_count has 93067 outliers

category_click_ratio

numerical

Approximate Distinct Count 1052
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.8564
Minimum 0.005051
Maximum 1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • category_click_ratio is skewed left (γ1 = -1.5913)

Quantile Statistics

Minimum 0.005051
5-th Percentile 0.2857
Q1 0.7857
Median 1
Q3 1
95-th Percentile 1
Maximum 1
Range 0.9949
IQR 0.2143

Descriptive Statistics

Mean 0.8564
Standard Deviation 0.2483
Variance 0.06163
Sum 1.1747e+06
Skewness -1.5913
Kurtosis 1.2695
Coefficient of Variation 0.2899
  • category_click_ratio is not normally distributed (p-value 6.2008985135665895e-25)
  • category_click_ratio has 144218 outliers

unique_category_count

numerical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 1.4909
Minimum 1
Maximum 10
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • unique_category_count is skewed right (γ1 = 2.6713)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 1
Q3 2
95-th Percentile 3
Maximum 10
Range 9
IQR 1

Descriptive Statistics

Mean 1.4909
Standard Deviation 0.8977
Variance 0.8059
Sum 2.045e+06
Skewness 2.6713
Kurtosis 10.0963
Coefficient of Variation 0.6021
  • unique_category_count is not normally distributed (p-value 5.809262830185703e-23)
  • unique_category_count has 52478 outliers

is_special_offer

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174
  • The largest value (1) is over 2.27 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 2.27 times larger than the second largest value (0)
  • is_special_offer has words of constant length

is_missing_category

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174
  • The largest value (0) is over 24.71 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 1
4th row 1
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 24.71 times larger than the second largest value (1)
  • is_missing_category has words of constant length

is_brand

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174
  • The largest value (0) is over 852.54 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 852.54 times larger than the second largest value (1)
  • is_brand has words of constant length

is_main_category

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174
  • The largest value (0) is over 2.75 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 2.75 times larger than the second largest value (1)
  • is_main_category has words of constant length

special_offer_clicks

numerical

Approximate Distinct Count 91
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 4.8636
Minimum 0
Maximum 147
Zeros 223673
Zeros (%) 16.3%
Negatives 0
Negatives (%) 0.0%
  • special_offer_clicks is skewed right (γ1 = 6.1499)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 3
Q3 6
95-th Percentile 16
Maximum 147
Range 147
IQR 5

Descriptive Statistics

Mean 4.8636
Standard Deviation 7.3801
Variance 54.4662
Sum 6.6712e+06
Skewness 6.1499
Kurtosis 65.7932
Coefficient of Variation 1.5174
  • special_offer_clicks is not normally distributed (p-value 4.851285008287082e-19)
  • special_offer_clicks has 90784 outliers

missing_category_clicks

numerical

Approximate Distinct Count 60
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.5133
Minimum 0
Maximum 89
Zeros 1190130
Zeros (%) 86.8%
Negatives 0
Negatives (%) 0.0%
  • missing_category_clicks is skewed right (γ1 = 15.0603)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 2
Maximum 89
Range 89
IQR 0

Descriptive Statistics

Mean 0.5133
Standard Deviation 3.3127
Variance 10.9741
Sum 704074
Skewness 15.0603
Kurtosis 285.9951
Coefficient of Variation 6.4536
  • missing_category_clicks is not normally distributed (p-value 4.644043445110715e-25)
  • missing_category_clicks has 181509 outliers

brand_clicks

numerical

Approximate Distinct Count 13
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.005553
Minimum 0
Maximum 14
Zeros 1367264
Zeros (%) 99.7%
Negatives 0
Negatives (%) 0.0%
  • brand_clicks is skewed right (γ1 = 34.8065)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 14
Range 14
IQR 0

Descriptive Statistics

Mean 0.005553
Standard Deviation 0.1228
Variance 0.01508
Sum 7617
Skewness 34.8065
Kurtosis 1677.7245
Coefficient of Variation 22.1105
  • brand_clicks is not normally distributed (p-value 4.227520909168155e-25)

main_category_clicks

numerical

Approximate Distinct Count 71
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 2.2011
Minimum 0
Maximum 124
Zeros 813145
Zeros (%) 59.3%
Negatives 0
Negatives (%) 0.0%
  • main_category_clicks is skewed right (γ1 = 7.2561)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 2
95-th Percentile 11
Maximum 124
Range 124
IQR 2

Descriptive Statistics

Mean 2.2011
Standard Deviation 5.6368
Variance 31.7736
Sum 3.0192e+06
Skewness 7.2561
Kurtosis 91.6452
Coefficient of Variation 2.5609
  • main_category_clicks is not normally distributed (p-value 1.1461478215977371e-24)
  • main_category_clicks has 156125 outliers

special_offer_click_ratio

numerical

Approximate Distinct Count 691
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.6957
Minimum 0
Maximum 1
Zeros 223673
Zeros (%) 16.3%
Negatives 0
Negatives (%) 0.0%
  • special_offer_click_ratio is skewed left (γ1 = -0.8227)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.3333
Median 1
Q3 1
95-th Percentile 1
Maximum 1
Range 1
IQR 0.6667

Descriptive Statistics

Mean 0.6957
Standard Deviation 0.3963
Variance 0.157
Sum 954248.5801
Skewness -0.8227
Kurtosis -1.0125
Coefficient of Variation 0.5696
  • special_offer_click_ratio is not normally distributed (p-value 2.7345707101431564e-23)

missing_category_click_ratio

numerical

Approximate Distinct Count 485
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.03885
Minimum 0
Maximum 1
Zeros 1190130
Zeros (%) 86.8%
Negatives 0
Negatives (%) 0.0%
  • missing_category_click_ratio is skewed right (γ1 = 4.7016)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0.3333
Maximum 1
Range 1
IQR 0

Descriptive Statistics

Mean 0.03885
Standard Deviation 0.1364
Variance 0.01862
Sum 53283.5591
Skewness 4.7016
Kurtosis 24.7365
Coefficient of Variation 3.5122
  • missing_category_click_ratio is not normally distributed (p-value 4.410474688404535e-25)
  • missing_category_click_ratio has 181509 outliers

brand_click_ratio

numerical

Approximate Distinct Count 75
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.001193
Minimum 0
Maximum 1
Zeros 1367264
Zeros (%) 99.7%
Negatives 0
Negatives (%) 0.0%
  • brand_click_ratio is skewed right (γ1 = 29.6523)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 1
Range 1
IQR 0

Descriptive Statistics

Mean 0.001193
Standard Deviation 0.02912
Variance 0.00084817
Sum 1636.8986
Skewness 29.6523
Kurtosis 943.8498
Coefficient of Variation 24.4039
  • brand_click_ratio is not normally distributed (p-value 4.226666087729338e-25)

main_category_click_ratio

numerical

Approximate Distinct Count 632
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.2651
Minimum 0
Maximum 1
Zeros 813145
Zeros (%) 59.3%
Negatives 0
Negatives (%) 0.0%
  • main_category_click_ratio is skewed right (γ1 = 1.0258)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.5
95-th Percentile 1
Maximum 1
Range 1
IQR 0.5

Descriptive Statistics

Mean 0.2651
Standard Deviation 0.3772
Variance 0.1422
Sum 363680.3142
Skewness 1.0258
Kurtosis -0.5875
Coefficient of Variation 1.4225
  • main_category_click_ratio is not normally distributed (p-value 5.797972388957594e-24)

price

numerical

Approximate Distinct Count 314
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16459668
Mean 78.9665
Minimum 0
Maximum 334998
Zeros 1327538
Zeros (%) 96.8%
Negatives 0
Negatives (%) 0.0%
  • price is skewed right (γ1 = 80.5058)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 334998
Range 334998
IQR 0

Descriptive Statistics

Mean 78.9665
Standard Deviation 1117.5974
Variance 1.249e+06
Sum 1.0831e+08
Skewness 80.5058
Kurtosis 15316.3949
Coefficient of Variation 14.1528
  • price is not normally distributed (p-value 4.227259194956075e-25)
  • price has 44101 outliers

quantity

numerical

Approximate Distinct Count 18
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 12344751
Mean 0.04108
Minimum 0
Maximum 30
Zeros 1327538
Zeros (%) 96.8%
Negatives 0
Negatives (%) 0.0%
  • quantity is skewed right (γ1 = 26.472)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 30
Range 30
IQR 0

Descriptive Statistics

Mean 0.04108
Standard Deviation 0.305
Variance 0.09303
Sum 56348
Skewness 26.472
Kurtosis 1619.5796
Coefficient of Variation 7.4244
  • quantity is not normally distributed (p-value 4.4193897523179615e-25)
  • quantity has 44101 outliers

buy_hour

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.5016
Minimum 0
Maximum 23
Zeros 1320833
Zeros (%) 96.3%
Negatives 0
Negatives (%) 0.0%
  • buy_hour is skewed right (γ1 = 5.671)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 23
Range 23
IQR 0

Descriptive Statistics

Mean 0.5016
Standard Deviation 2.7072
Variance 7.3291
Sum 688050
Skewness 5.671
Kurtosis 32.2118
Coefficient of Variation 5.3969
  • buy_hour is not normally distributed (p-value 4.24090279429147e-25)
  • buy_hour has 50806 outliers

buy_minute

numerical

Approximate Distinct Count 60
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 1.0966
Minimum 0
Maximum 59
Zeros 1321566
Zeros (%) 96.4%
Negatives 0
Negatives (%) 0.0%
  • buy_minute is skewed right (γ1 = 6.5452)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 59
Range 59
IQR 0

Descriptive Statistics

Mean 1.0966
Standard Deviation 6.5072
Variance 42.3431
Sum 1.5041e+06
Skewness 6.5452
Kurtosis 44.0409
Coefficient of Variation 5.934
  • buy_minute is not normally distributed (p-value 4.227285457330005e-25)
  • buy_minute has 50073 outliers

buy_day

numerical

Approximate Distinct Count 16
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.5558
Minimum 0
Maximum 19
Zeros 1320759
Zeros (%) 96.3%
Negatives 0
Negatives (%) 0.0%
  • buy_day is skewed right (γ1 = 5.1372)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 19
Range 19
IQR 0

Descriptive Statistics

Mean 0.5558
Standard Deviation 2.8855
Variance 8.3261
Sum 762400
Skewness 5.1372
Kurtosis 24.9488
Coefficient of Variation 5.1913
  • buy_day is not normally distributed (p-value 4.269614333152265e-25)
  • buy_day has 50880 outliers

buy_dayofweek

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 93271452
  • The largest value (0.0) is over 93.69 times larger than the second largest value (6.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 0.0
2nd row 5.0
3rd row 5.0
4th row 5.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2743278
  • The top 2 categories (0.0, 6.0) take over 50.0%
  • The largest value (00) is over 93.69 times larger than the second largest value (60)
  • buy_dayofweek has words of constant length

buy_is_weekend

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 93271452
  • The largest value (0.0) is over 59.64 times larger than the second largest value (1.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 0.0
2nd row 1.0
3rd row 1.0
4th row 1.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2743278
  • The top 2 categories (0.0, 1.0) take over 50.0%
  • The largest value (00) is over 59.64 times larger than the second largest value (10)
  • buy_is_weekend has words of constant length

buy_month

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 93271452
  • The largest value (0.0) is over 25.96 times larger than the second largest value (8.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 0.0
2nd row 8.0
3rd row 8.0
4th row 8.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2743278
  • The top 2 categories (0.0, 8.0) take over 50.0%
  • The largest value (00) is over 25.96 times larger than the second largest value (80)
  • buy_month has words of constant length

total_amount

numerical

Approximate Distinct Count 718
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16459668
Mean 91.3893
Minimum 0
Maximum 334998
Zeros 1327538
Zeros (%) 96.8%
Negatives 0
Negatives (%) 0.0%
  • total_amount is skewed right (γ1 = 67.1992)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 334998
Range 334998
IQR 0

Descriptive Statistics

Mean 91.3893
Standard Deviation 1210.5806
Variance 1.4655e+06
Sum 1.2535e+08
Skewness 67.1992
Kurtosis 11286.0649
Coefficient of Variation 13.2464
  • total_amount is not normally distributed (p-value 4.2275795348072765e-25)
  • total_amount has 44101 outliers

session_buy_count

numerical

Approximate Distinct Count 30
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.1512
Minimum 0
Maximum 48
Zeros 1320759
Zeros (%) 96.3%
Negatives 0
Negatives (%) 0.0%
  • session_buy_count is skewed right (γ1 = 9.6158)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 48
Range 48
IQR 0

Descriptive Statistics

Mean 0.1512
Standard Deviation 0.9734
Variance 0.9474
Sum 207389
Skewness 9.6158
Kurtosis 142.8625
Coefficient of Variation 6.4377
  • session_buy_count is not normally distributed (p-value 4.265607633392067e-25)
  • session_buy_count has 50880 outliers

total_amount_sum

numerical

Approximate Distinct Count 3133
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16459668
Mean 229.9656
Minimum 0
Maximum 334998
Zeros 1327538
Zeros (%) 96.8%
Negatives 0
Negatives (%) 0.0%
  • total_amount_sum is skewed right (γ1 = 34.3518)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 334998
Range 334998
IQR 0

Descriptive Statistics

Mean 229.9656
Standard Deviation 2175.5881
Variance 4.7332e+06
Sum 3.1543e+08
Skewness 34.3518
Kurtosis 2634.2359
Coefficient of Variation 9.4605
  • total_amount_sum is not normally distributed (p-value 4.237330583259869e-25)
  • total_amount_sum has 44101 outliers

time_diff

numerical

Approximate Distinct Count 49936
Approximate Unique (%) 3.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 24.4207
Minimum 0
Maximum 1799.877
Zeros 1320759
Zeros (%) 96.3%
Negatives 0
Negatives (%) 0.0%
  • time_diff is skewed right (γ1 = 7.2827)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 1799.877
Range 1799.877
IQR 0

Descriptive Statistics

Mean 24.4207
Standard Deviation 147.4985
Variance 21755.8184
Sum 3.3496e+07
Skewness 7.2827
Kurtosis 58.7129
Coefficient of Variation 6.0399
  • time_diff is not normally distributed (p-value 4.229171921836806e-25)
  • time_diff has 50880 outliers

conversion

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174
  • The largest value (0) is over 30.1 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 30.1 times larger than the second largest value (1)
  • conversion has words of constant length

conversion_session

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90528174
  • The largest value (0) is over 12.22 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1371639
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 12.22 times larger than the second largest value (1)
  • conversion_session has words of constant length

avg_time_diff

numerical

Approximate Distinct Count 22833
Approximate Unique (%) 1.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 63.1056
Minimum 0
Maximum 1799.849
Zeros 1251762
Zeros (%) 91.3%
Negatives 0
Negatives (%) 0.0%
  • avg_time_diff is skewed right (γ1 = 4.168)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 604.3485
Maximum 1799.849
Range 1799.849
IQR 0

Descriptive Statistics

Mean 63.1056
Standard Deviation 233.9134
Variance 54715.4691
Sum 8.6558e+07
Skewness 4.168
Kurtosis 17.914
Coefficient of Variation 3.7067
  • avg_time_diff is not normally distributed (p-value 4.240817064513237e-25)
  • avg_time_diff has 119877 outliers

conversion_rate

numerical

Approximate Distinct Count 367
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 21946224
Mean 0.02049
Minimum 0
Maximum 9
Zeros 1320759
Zeros (%) 96.3%
Negatives 0
Negatives (%) 0.0%
  • conversion_rate is skewed right (γ1 = 9.7547)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0
Maximum 9
Range 9
IQR 0

Descriptive Statistics

Mean 0.02049
Standard Deviation 0.1237
Variance 0.01531
Sum 28099.2201
Skewness 9.7547
Kurtosis 231.7982
Coefficient of Variation 6.0397
  • conversion_rate is not normally distributed (p-value 4.285409388655764e-25)
  • conversion_rate has 50880 outliers

Interactions

Correlations

Missing Values